Proactive-Reactive Prediction for Data Streams
نویسندگان
چکیده
Prediction in streaming data is an important activity in various branches of science such as sociology, economics and politics. Two major challenges offered by data streams are (1) the underlying concept of the data may change over time; and (2) the data may grow without limit so that it is difficult to retain a long history of raw data. Previous research has mainly focused on manipulating relatively recent data. The distinctive contribution of this paper is in three folds. First, it uses a measure of conceptual equivalence to organize the data history into a history of concepts. Transition patterns among concepts can be learned from this history to help prediction. Second, it carries out prediction at two levels, a general level of predicting each oncoming concept and a specific level of predicting each instance’s class. Third, it proposes a system RePro that incorporates reactive and proactive mechanisms to predict in streaming data with efficacy and efficiency. Experiments are conducted to compare RePro with representative existing prediction methods on various benchmark data sets that represent diversified scenarios of concept change. Empirical evidence offers inspiring insights and suggests the proposed methodology is an advisable solution to prediction for data streams.
منابع مشابه
Proactive and reactive multi - dimensional histogram maintenance for selectivity estimation q
Many state-of-the-art selectivity estimation methods use query feedback to maintain histogram buckets, thereby using the limited memory efficiently. However, they are ‘‘reactive’’ in nature, that is, they update the histogram based on queries that have come to the system in the past for evaluation. In some applications, future occurrences of certain queries may be predicted and a ‘‘proactive’’ ...
متن کاملUncertain Resource Availabilities: Proactive and Reactive Procedures for Preemptive Resource Constrained project Scheduling Problem
Project scheduling is the part of project management that deals with determining when intime to start (and finish) which activities and with the allocation of scarce resources to theproject activities. In practice, virtually all project managers are confronted with resourcescarceness. In such cases, the Resource-Constrained Project Scheduling Problem (RCPSP)arises. This optimization problem has...
متن کاملProposed Feature Selection for Dynamic Thermal Management in Multicore Systems
Increasing the number of cores in order to the demand of more computing power has led to increasing the processor temperature of a multi-core system. One of the main approaches for reducing temperature is the dynamic thermal management techniques. These methods divided into two classes, reactive and proactive. Proactive methods manage the processor temperature, by forecasting the temperature be...
متن کاملProactive and reactive multi-dimensional histogram maintenance for selectivity estimation
Many state-of-the-art selectivity estimation methods use query feedback to maintain histogram buckets, thereby using the limited memory efficiently. However, they are “reactive” in nature, that is, they update the histogram based on queries that have come to the system in the past for evaluation. In some applications, future occurrences of certain queries may be predicted and a “proactive” appr...
متن کاملCharacterizing Drifts for Proactive Drift Detection in Data Streams
The evolution of data such as changes in the underlying model known as concept drift present many challenges for data stream research. Currently most drift detection methods are able to locate the point of change, but are unable to provide meaningful information on the characteristics of change or utilize historical trends. In this thesis, we investigate two streams of research: (1) the magnitu...
متن کامل